Load balancing of irregular parallel divide-and-conquer algorithms in group-SPMD programming environments

نویسندگان

  • Mattias V. Eriksson
  • Christoph W. Kessler
  • Mikhail Chalabine
چکیده

We study strategies for local load balancing of irregular parallel divide-andconquer algorithms such as Quicksort and Quickhull in SPMD-parallel environments such as MPI and Fork that allow to exploit nested parallelism by dynamic group splitting. We propose two new local strategies, repivoting and serialisation, and develop a hybrid local load balancing strategy, which is calibrated by parameters that are derived off-line from a dynamic programming optimisation. While the approach is generic, we have implemented and evaluated our method for two very different parallel platforms. We found that our local strategy is superior to global dynamic load balancing on a Linux cluster, while the latter performs better on a tightly synchronised sharedmemory platform with nonblocking, cheap task queue access.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hierarchical Work Stealing on Manycore Clusters

Partitioned Global Address Space languages like UPC offer a convenient way of expressing large shared data structures, especially for irregular structures that require asynchronous random access. But the static SPMD parallelism model of UPC does not support divide and conquer parallelism or other forms of dynamic parallelism. We introduce a dynamic tasking library for UPC that provides a simple...

متن کامل

An Efficient Implementation of Nested Data Parallelism for Irregular Divide-and-Conquer Algorithms

This paper presents work in progress on a new method of implementing irregular divide-and-conquer algorithms in a nested data-parallel language model on distributedmemory multiprocessors. The main features discussed are the recursive subdivision of asynchronous processor groups to match the change from data-parallel to control-parallel behavior over the lifetime of an algorithm, switching from ...

متن کامل

A Scalable Dynamic Load-Balancing Algorithm for SPMD Applications on a Non-Dedicated Heterogeneous Network of Workstations (HNOW)

Network based distributed computing has become a cheap alternative to dedicated parallel supercomputer systems due to the recent advances in high-speed network. These systems are becoming widely available in academic and industrial environments. To benefit from the maximum computation power of these systems, it is necessary to use all available resources, namely old machines in addition to more...

متن کامل

Skeletons for Divide and Conquer Algorithms

Algorithmic skeletons intend to simplify parallel programming by providing recurring forms of program structure as predefined components. We present a fully distributed task parallel skeleton for a very general class of divide and conquer algorithms for MIMD machines with distributed memory. This approach is compared to a simple masterworker design. Based on experimental results for different e...

متن کامل

Implementation of a Region Growing Algorithm on Multicomputers: Analysis of the Work Load Balance

This paper discusses and evaluates different parallel implementations of a Region Growing algorithm based on the Split-and-Merge approach proposed by Horowitz and Pavlidis [1]. Region growing is a general technique for image segmentation. The basic scheme consists of joining adjacent pixels to form regions; adjacent regions are then merged to obtain larger regions. The solution has been conceiv...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006